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ABSTRACT 

Wide field surveys will soon be discovering Type la supcrnovae (SNe) at rates of several thou- 
sand per year. Spectroscopic follow-up can only scratch the surface for such enormous samples, 
so these extensive data sets will only be useful to the extent that they can be characterized 
by the survey photometry alone. In a companion paper (Rodney and Tonry, 2009) we intro- 
duced the SOFT method for analyzing SNe using direct comparison to template light curves, 
and demonstrated its application for photometric SN classification. In this work we extend the 
SOFT method to derive estimates of redshift and luminosity distance for Type la SNe, using light 
curves from the SDSS and SNLS surveys as a validation set. Redshifts determined by SOFT using 
light curves alone are consistent with spectroscopic redshifts, showing a root-mean-square scatter 
in the residuals of RMS Z = 0.051. SOFT can also derive simultaneous redshift and distance 
estimates, yielding results that are consistent with the currently favored ACDM cosmological 
model. When SOFT is given spectroscopic information for SN classification and redshift priors, 
the RMS scatter in Hubble diagram residuals is 0.18 mags for the SDSS data and 0.28 mags for 
the SNLS objects. Without access to any spectroscopic information, and even without any red- 
shift priors from host galaxy photometry, SOFT can still measure reliable redshifts and distances, 
with an increase in the Hubble residuals to 0.37 mags for the combined SDSS and SNLS data 
set. Using Monte Carlo simulations we predict that SOFT will be able to improve constraints on 
time-variable dark energy models by a factor of 2-3 with each new generation of large-scale SN 
surveys. 

Subject headings: methods:statistical — supernovae:general 



The use of Thermonuclear Supcrnovae (TN SNe, 
i.e. Type la SNe) as cosmological standard candles 
requires the determination of each SN's redshift 
and distance modulus. This has traditionally been 
done with two parallel measurements: a spectrum 
confirms the object's class and provides a very ac- 
curate redshift, while a broad band light curve al- 
lows the estimation of luminosity distance. Tech- 
nological advances in wide-field imagers are now 
opening the door for a new generation of large area 



surveys. Proje cts such as the Palomar Transien t 
Factory (PTF; iRau et al.l l2009t lLaw et al.ll2009l) . 
and Pan-STARRS 1 fPSl; lKaiser et al.ll2002l )~are 
already beginning to amass SN samples that will 
eventually number in the thousands. In coming 
years Pan-STARRS 4E LSSlH and JDEM^I will 
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uncover tens of thousands of SNe. From these vast 
collections of SN light curves, only a small fraction 
will be spectroscopically observed, meaning that 
any analyses applied to them will have to rely on 
photometry alone. 

In lRodnev fc Tonrvl (|2009l ) (hereafter Paper I), 
we introduced SOFT (Supernova Ontology with 
Fuzzy Templates) and described in detail the 
structure and form of the SOFT light curve anal- 
ysis method. In this paper, Section [T] begins with 
a brief review of the basis for the SOFT method 
and the fuzzy set theory framework for light curve 
comparisons. In Section [2] we apply the SOFT 
models to the task of estimating cosmological pa- 
rameters using SN light curves. Verification tests 
using data from the Sloan Digital Sky Survey 
(SDSS) and the Supernova Legacy Survey (SNLS) 
are described in Section |3] The results of these 
tests are then extended to produce Monte Carlo 
simulations of future surveys to examine how well 
an approach such as SOFT will be able to con- 
strain dark energy models. A discussion of the 
inherent biases in our template estimators is pro- 
vided in Section [5j and a summary of results is 
presented in Section [5] 

1. Light Curve Models 

The shape of an observed SN light curve can 
be described by two sets of parameters. The first 
set, which we denote consists of all relevant 
physical parameters that determine the intrinsic 
shape and color of the light curve (e.g. the 56 Ni 
mass, the degree of mixing in the stellar interior, 
etc.). A second set of parameters, 9, relate to 
the location of the SN in space and time: redshift 
z, luminosity distance /i e u host galaxy extinction 
Ay, and time of peak t pk . These four location 
parameters modify the intrinsic light curve shape 
by stretching and dimming it into the form that 
we eventually observe. 

We do not have a complete physical model 
for SN explosions that could efficiently and re- 
liably translate a vector of physical parameters 
3? into a precise light curve prediction. There- 
fore, m ost TN SN light c urve fitter s - such as 
MLCS (|Riess et all Il996t Ijhal I2002T) and SALT 



(|Guv et al.l |2005[ 120071) - use one or more para- 
maters (e.g. A, or XI and C) to define the shape 
and color of a broad-band SN light curve. By con- 
trast, the SOFT method has no shape or color pa- 
rameters, but instead uses a collection of SN light 
curve templates to provide examples of possible 
shapes and colors. These template light curves are 
then warped to reproduce the effects of a change 
in location 6. By comparing this adapted light 
curve model against a candidate SN object, we 
can derive the likelihood that the candidate has 
an intrinsic light curve shape similar to the tem- 
plate, and is at the assumed location 6. 

In Paper I we discussed how our template-based 
approach is best ser ved by utilizi ng the framework 
of fuzzy set theory (jZadehl 119651 ). and we applied 
SOFT to the task of SN classification. In this pa- 
per we will now explore how the application of 
fuzzy logic allows us to extend the SOFT method 
to the problem of estimating cosmologically useful 
parameters such as a SN's redshift and distance. 

2. Parameter Estimation 

Suppose we have a candidate object called SN X 
whose observed light curve is given by a series of 
points D, t = (ti,fi,o~i). Here ti is the observation 
time, fi is the measured flux, and &i is the uncer- 
tainty for epoch i. The complete light curve with 
N data points is denoted by the vector D. If we 
assume a set of location parameters 6 then a light 
curve model Mj that realizes a given $ provides 
a prediction for a SN's flux function of time: 
"Fj(ti, 9). 

In the framework of traditional probability the- 
ory, we would describe the probability of observing 
the data D given model Mj and location as the 
combined probability of the data-minus-model er- 
ror terms: 



N 



p ( D| ,M j ) = n^-ex P 



-(/i-^-(ti,e)) s 

2o-1 



4 The SOFT luminosity distance parameter fi e is the dis- 
tance modulus residual relative to an empty universe. See 
equation 3 of Paper I. 



(1) 

(see Equations 4-6 in Paper I). We then compute 
the probability that our candidate object resides 
at any particular location 6 by applying Baycs the- 
orem: 
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p(0|D,M,-) 



J p{e\Mj) p{T>\e,Mj) de 



(2) 



This gives us a probability distribution over the 
four dimensions of 8, under the assumption that 
our candidate SN X is physically identical to 
the model Mj. Variations of this approach us- 
ing parameterized light curv e models have been 
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With SOFT, however, we remove the need for a 
parametric model of the SN light curve by limiting 
ourselves to a finite set of fixed-shape models. To 
keep within the framework of traditional proba- 
bility theory, we could apply Equation [2] to derive 
probabilities from all our models and then com- 
bine the results usi ng a method such as Bayesian 



Model Averaging ( Hoeting et al. 19991 ). Unfor 



tunatcly, that approach is applicable only in the 
case where our set of models is complete and non- 
redundant. If cither of those two conditions is not 
satisfied, then the normalization inherent in ap- 
plying Bayes' theorem is not valid. In Paper I 
we have argued that it is impossible to completely 
satisfy both conditions when using a finite library 
of fixed-shape templates. 

2.1. Biased Estimators 

In addition to problems of logical incongru- 
ence described above and in Paper I, our finite 
set of templates will necessarily introduce an el- 
ement of bias to any parameter estimates. To 
illustrate this problem, suppose we have an ex- 
tremely sparse template library, consisting of only 
two template models, Mi and M-i- Now let us ob- 
serve a candidate object, SN X, which has physi- 
cal parameters <&x that are intermediate between 
€>i and 3?2 (e.g., perhaps the candidate has a 
56 Ni mass that is bracketed by the two templates: 
Mmi,i < Mmi,x < Mpji^)- SN X is not a perfect 
match with either of the two models, so applying 
Eq.[2]will give us a biased parameter estimate from 
each of them. For simplicity suppose that all the 
location parameters are fixed except for the cos- 
mological dimming, 9~fi e . The model M2 with 



a higher M uj will have a higher bolometric flux 
than SN X (jArnettl Il982h . Thus, the best match 
from the Mi model will have to increase the cos- 
mological dimming \i e away from the true value to 
compensate^ Conversely, the fainter M\ model 
must decrease the cosmological dimming (lower 
/i e — > higher flux) to bring it up to the level of 
SN X. Therefore Mi will report a lower /j, e as the 
most likely parameter. 

In Figure [T^, we illustrate this parameter bias 
using a real SN light curve from the SNLS, the 
Type la SN 04D2ja. We compared this object's 
light curve against all 28 TN SN templates, in each 
case using equation [2] to construct a probability 
distribution in the z-[i e plane. The sum of these 
distributions, shown in Figure QJi, is diffuse and 
inaccurate, with multiple peaks spread across the 
parameter space. No individual template provides 
a clear match to the shape of the candidate light 
curve, so each template requires an offset from the 
true location in order to compensate for the shape 
mismatch. 

2.2. Fuzzy SN Models 

We now have two problems arising from the tra- 
ditional (Bayesian) approach to parameter estima- 
tion when using a finite set of templates. First is 
the problem of violating the assumptions of com- 
pleteness and non-redundancy (see section 4 of Pa- 
per I), and second is the issue of parameter esti- 
mation biases described above. We propose that 
both of these problems are resolved by invoking 
fuzzy logic, an alternative logical framework that 
is better suited to the case of our limited template 
library. 

In Paper I we developed the basic structure of a 
fuzzy SN model. Each template model Mj carries 
an associated "model fuzziness" term, er^ , which 
is used to define a fuzzy set around that template. 
The cr'j term quantifies how membership in that 
model's fuzzy set falls off as light curves become 
more and more dissimilar to the template curve: 



5 Recall that /i e is a distance measured in magnitudes, so a 
higher fi e makes the intrinsically bright model M2 appear 
farther away and fainter, bringing it closer in form to the 
less luminous SN X. 
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Fig. 1. — Demonstration of the parameter estimation biases inherent in a finite set of light curve templates, 
and the bias averaging enabled by fuzzy templates. The candidate SN used here was the SNLS object 
04D2ja. (a) Probability distributions constructed using the strict Bayesian approach of Equations [1] and 
[5] (i.e. without any model fuzziness). The 'x' marks the spectroscopic redshift of this SN and the corre- 
sponding distance modulus for a universe with H ,Qm,Qa = (70,0.3,0.7). The gray-scale shading defines 
the probability distribution, and the contour line encloses the region about the peak that contains 68% of 
the probability, (b) The introduction of model fuzziness using a'j as in Section 12.21 expands the area of 
parameter space in which each model can provide a fair match. For this candidate SN, our template library 
finds 5 templates that provide reasonable fits. Here we display the "probability" cloud that defines each 
individual fuzzy set membership function, simply summing them together without using a proper fuzzy set 
combination operator. Each cloud is marked with the designation of its generating template and a percentage 
that indicates its integrated membership grade 7 (Eq. [5j. (c) After rejecting outliers with a membership cut 
at 5%, we combine the remaining fuzzy membership functions using the Dombi intersection operator. This 
"Fuzzy AND" operation tends to average out the biases of any individual template. 



N 

?"< D ">^>= flTBvW (3) 

(cf. Eq.CQand Paper I, Eq. 13). 

It is no accident that this expression for the 
likelihood is functionally identical to what we 
would use in traditional probability theory if a'j 
was a simple model uncertainty There is a con- 
ceptual difference between the two types of un- 
certainty: our model fuzziness represents an in- 
tentional "vagueness" or "indefiniteness," whereas 
the traditional model uncertainty encapsulates an 
"imprecision" or "inaccuracy." In the language 
of set theory, our fuzzy model defines a set that 
includes "All SNe that are physically similar to 
model Mj." This leaves room for our model to 
be applied to objects that we know to be slightly 



different from our template. By contrast, in tradi- 
tional probability theory a model Mj defines a set 
that includes "All SNe that are physically identi- 
cal to model Mj ," and the addition of model un- 
certainty would simply acknowledge that we don't 
know exactly what our model Mj really looks like. 
Although the mathematical form is the same, the 
fuzzy concept of vagueness allows us to adjust the 
strength of a'j to suit our purposes (increasing 
a'j makes the model more inclusive but less pre- 
dictive), whereas a traditional model uncertainty 
should be derived strictly from empirical evidence. 
As we will see in the next section, this model fuzzi- 
ness also leads us to a powerful advantage within 
fuzzy logic, namely that we can apply a fuzzy AND 
combination instead of the less informative OR. 

To convert the likelihood of Equation [3] into 
a fuzzy membership grade we introduce a prior 
on the location parameters, p(9\Mj), as well as 
a prior for the particular template in question, 
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p(Mj). The final membership grade for our can- 
didate SN X in the (6,Mj) fuzzy set is given by: 



g(p\e, Mj) = p(0\Mj) p{Mi) p"(D|0, Mj ) (4) 

(see Paper I, Eq. 14 and section 5.1 for a full de- 
scription of fuzzy membership functions). 

To compare the quality of fit between two or 
more models, we define a scalar quantity 7 as a 
normalized integral over the location parameters 
0. For a single model Mj we have: 





E Jg(D\e,M k )se 

k 



(5) 



where the summation is over all available models. 
The value of jj lies in the range [0,1] and measures 
the candidate object's strength of association with 
model Mj relative to all other models. This term 
can serve as the fuzzy analog for a Bayesian pos- 
terior probability. 

By adding an element of fuzziness to the SN 
models, we have alleviated the logical inconsisten- 
cies of the Bayesian approach when applied to our 
finite template library (see section 4.1 of Paper I). 
We have not, however, removed the inherent bi- 
ases in the parameter estimates of individual tem- 
plates. Returning to our example from the SNLS, 
in Figure QJ> we show a stack of membership func- 
tions from comparison of the candidate SN against 
our fuzzy TN SN models. The five most prominent 
membership function clouds are labeled with the 
templates they originate from as well as a per- 
centage showing the value of 7 from Equation [5] 
The addition of model fuzziness has smoothed the 
peaks of the distribution, but individual member- 
ship functions arc still biased by the mismatch of 
light curve shape between our candidate and the 
templates. We would like to be able to combine 
these membership functions together in a way that 
gives us an improved composite parameter esti- 
mate. In the following section we explore how this 
can be achieved using the structures of fuzzy logic. 

2.3. Fuzzy Combination 

The two principle operators for fuzzy logic can 
be defined in a number of ways. As descr ibed in 
the companion paper, we have adopted the lDornbi 



(|l982j ) operators (see Paper I, Section 5.2, Equa- 
tions 16 and 17), which define how two fuzzy mem- 
bership functions can be combined in a logical in- 
tersection (fuzzy AND) or union (fuzzy OR). To 
choose whether to use the intersection or the union 
operator for our SN models we must consider how 
much interaction between the fuzzy sets is appro- 
priate for the problem. 

Suppose we have done two model comparisons, 
using models Mi and Mi to get two membership 
functions g x = g(D|0,Mi) and g 2 = g{T>\9,M 2 ). 
Let us also suppose that these two models are 
based on SNe that have intrinsically very similar 
physical characteristics: <I?i ~ €>2- If it happens 
that our candidate SN X has underlying physical 
parameters 3?x that are similar to both <l?i and 
<J>2, then we should expect that the location pa- 
rameter estimates from those two models should 
strongly agree. That is, the location parameter bi- 
ases from these two models should be small enough 
that the true location of our candidate, Ox, will 
be simultaneously favored by both M\ and M 2 . 
This argues for the use of the intersection opera- 
tor (Paper I, Eq. 17) when combining g\ with g 2 . 

Now suppose that we carry out a third model 
comparison with model M3 , and let us assume that 
this model has intrinsically very different physi- 
cal parameters $3. The larger separation between 
$3 and &x means that the light curve shapes of 
M3 and SN X will be substantially different, and 
therefore M3 will have to introduce a large 6 bias 
in order to compensate. In this case, we should 
expect that the intersection of 173 = g(D\6, M3) 
with cither 171 or g 2 will not provide a useful esti- 
mate of the true candidate location. Thus, an in- 
tersection operation would give misleading results, 
as the outlier 33 pulls the composite membership 
distribution away from the real value of Ox- 

If we expect that template mismatches such 
as M 3 are included in the group of membership 
functions being combined, then we cannot jus- 
tify using the fuzzy AND operator. If, however, 
we can exclude those outliers from our combina- 
tion group, then the intersection operation is jus- 
tified, and should provide a more precise estimate 
of Ox- The 7 value provides a simple (though 
somewhat crude) metric for this purpose. We can 
exclude templates that have very different light 
curve shapes from our candidate by rejecting any 
models that yield a 7 value below some threshold. 
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For the validation tests described in the following 
section we have used a cutoff of 7 < 

In Figure [TJ; we show the end result of applying 
the fuzzy AND operation to combine membership 
functions for the SNLS object 04D2ja. The re- 
sulting composite membership function has a sin- 
gle peak in the region that all models collectively 
agree upon as a likely location for the candidate 
SN. In the case of 04D2ja, the parameter esti- 
mate from this intersection of membership func- 
tions provides a better estimate than any single 
light curve template could provide. This is true 
because each individual template has a location 
parameter bias that attempts to compensate for 
the mis-match in light curve shape. The length 
and direction of these bias vectors are distributed 
more or less randomly over the space of 6. Thus, 
when many models are combined together the bi- 
ases tend to cancel out. It is important to note 
that if we had not used the crude 5% threshold to 
cut out severe mismatches, then a few of the mod- 
els that are most egregiously biased would have 
dominated the intersection operation, pulling the 
combined distribution off into a distant corner of 
6 space. In Section [5] we will discuss a possible 
method of calibrating the template library that 
could alleviate the need for this threshold cut. 

3. Verification Tests 
3.1. Data Sets 

To test the accuracy and precision of SOFT 
estimates for z and [x e , we used a combination 
of data from the SDSS-II and SNLS programs. 
We use 1 46 spectroscopically co nfirmed Type la 
SNe from iHoltzman et al.l ( 20081) and 71 from the 



SNLS project (jAstier et al 



2006). The SDSS sam- 



ple spans a redshift range from 0.01 to 0.45, while 
the SNLS SNe fall between 0.25 and 1.0. From this 
combined data set of 217 SNe, we have extracted a 



3 The choice of 5% is somewhat arbitrary, though it is con- 
nected to the size of the template library. With 28 light 
curves in our library, if they all had equal net membership 
grades then they would all have jj ~ 3.5%. Thus, setting 
the level at 5% is a moderately restrictive choice, and in 
practice it generally ensures that some 3 to 10 models are 
able to pass through the threshold rejection. If the tem- 
plate library had 50 or 100 light curves, then the distribu- 
tion of membership grades would be more diffuse. In that 
case a lower threshold of 4% or 2% would be appropriate, 
in order to maintain ~3 to 10 contributing models. 



series of subsets that attempt to eliminate objects 
that are most problematic for light curve analysis 
with SOFT. Table Q] lists the six subsets used. 

The SDSS-A subset rejects only one singularly 
peculiar object, SN2005gjQ We define the SDSS- 
B sample using photometric selection criteria: ex- 
cluding ten objects that are classified as core col- 
lapse SNe by SOFT (see Paper I for details on the 
classification procedure), and eight objects that 
have one-sided light curves, lacking either the pre- 
peak or the post-peak epochs entirely^ The final 
SDSS subset, SDSS-C, uses an additional cut to 
remove 11 more objects that were given a spectro- 
scopic classification of "la?" by IHoltzman et al 
(2008 ), indicating some classification uncertainty. 



The SNL S SNLS-A includes all 71 SNLS SNe 
presented by lAstier et al. ( 2006 ). We do not apply 
a photometric cut to the SNLS data in the manner 
of SDSS-B, because SOFT classifies all 71 SNe as 
Type la, and the peak is well defined for all light 
curves. For the SNLS-B sample we exclude 14 
objects that were given a "la*" classification in 
Astier et al.l (|2006j ) to indicate some classification 
ambiguity. 

Our final data set is a combination of the photo- 
metrically selected samples from SDSS and SNLS. 
With 127 SNe from SDSS-B and 71 SNe from 
SNLS-A, the Combo subset contains 198 objects 
with complete light curves that SOFT recognizes 
as Type la. 

3.2. Priors 

For each SN in the verification data sets, we 
want to use Eq. [3] to define fuzzy set member- 
ship functions. To compute the location prior 
term in that equation, p(6\Mj), we use flat un- 
informative priors for the parameters t pk and \x e . 
For host galaxy extinction we apply the "Galac- 
tic Line of Sigh t" Ay prior employed by the 
ESSENCE team (IWood-Vasev et al.ll2007l) . which 
is based on the host ga laxy extin c tion m odels 
of lHatano et all (|l998l ). IComminsI (|2004h . and 



7 This was an extremely lu minous SN that appea rs to be a 
hybrid Type Ia/IIn object l lAldering et al.ll2006r) . SOFT is 
unable to match a light curve to this object using any of 
the Type la templates it has available, but finds that the 
best fitting template is SN 1994Y, a Type Iln object. 

8 Note that the SDSS-B subset is not truly a photometrically 
selected sample, because spectro scopic criteria were use d to 
define the original set of 146 by IHoltzman et al. (2008). 
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Table 1 
Verification Data Subsets 



Data set 


N S Ne 


Description 


SDSS-A 


145 


all SNe from Holtzman et al. f2008) except SN05ei (unfittable) 


SDSS-B 


127 


photometric cut : -10 classified by SOFT as CC; -8 with one-sided light curves 


SDSS-C 


116 


photo-cut + spec, cut : -11 with spectroscopic classification "la?" 


SNLS-A 


71 


all 71 SNe from Astier et al. (2006) 


SNLS-B 


57 


spectroscopic cut : -14 with spectroscopic classification "la*" 


Combo 


198 


photometric cuts : union of SDSS-B with SNLS-A 



Riello k Patatl (|2005l ) (see Paper I, §3.1). 

For the rcdshift prior we consider two alterna- 
tive situations. In the first case, we assume a tra- 
ditional SN survey strategy, where all SNe have 
spectroscopic measurements to define their red- 
shifts and to aid in classification. The redshift 
prior for this scenario is based on the spectro- 
scopic mea surements reported by iHoltzman et al 



(120081 ) and lAstier et all (|200fih for the SDSS and 
SNLS data sets, respectively. The prior is sepa- 
rately defined for each SN as a Gaussian centered 
on the measured redshift with width defined by 
u z = 0.005. The parameter estimates that SOFT 
returns in this case can be more or less directly 
compared to the SN cosmology results elsewhere 
in the literature. 

Our second redshift prior is designed to evaluate 
how well SOFT can measure redshift and distance 
without any spectroscopic information. As a worst 
case scenario we assume that there is no spectro- 
scopic information available for any of the SNe or 
their host galaxies, and there are not even pho- 
tometric redshift estimates from the host galax- 
ies. For this case we want to use an uninforma- 
tive redshift prior, reflecting our total ignorance of 
each object's redshift. One possibility would be a 
flat distribution over z, normalized so that it in- 
tegrates to unity over the range of redshifts under 
consideration for each SN candidate. A slightly 
more realistic prior would take into account the 
increasing volume of space as a function of red- 
shift. For simplicity we assume a a flat Qm = 
universe so that the volume of a shell at redshift 
z (per unit solid angle) is 



p(z)dz (x z 2 dz 

This is the uninformativc prior that wc use 
throughout this paper, although we also tested 
the flat z prior and found no significant changes 
in our conclusions. 



3.3. Photometric Redshifts 

As a first test, we want to evaluate how well 
SOFT can determine a redshift based on the SN 
light curve alone. For this task we use the Com- 
bined data set, containing 198 photometrically 
identified Type la SNe from subsets SDSS-B and 
SNLS-A (see Table[T]). Using the uninformative z 2 
redshift prior, we determine a composite member- 
ship function from each TN SN model using Equa- 
tion [4j The membership functions are marginal- 
ized over all nuisance parameters (Ay,t pk ,^ e ) and 
combined using the fuzzy AND operator to get 
a composite membership function over redshift: 
g{z). We locate the peak of this distribution to 
define zsoft, and assign error bars by measuring 
the width of the contour that contains 68% of the 
area. 

Figure [2] plots these SOFT redshift values 
against the spcctroscopically measured redshifts 
from SDSS and SNLS. The residuals defined by 
Zsoft — z spec have an RMS scatter about zero of 
0.045 for the SDSS-A sample, 0.059 for SNLS-B, 
and 0.051 for the combined sample. The uncer- 
tainties for these data points are decidedly non- 
Gaussian and asymmetric. Nevertheless, the re- 
duced x 2 term gives a rough measure of the ac- 
curacy of our uncertainty estimates, and we find 
chi 2 /DOF = 0.98, 1.25, and 1.08 for the SDSS- 
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spectroscopic redshift 

Fig. 2. — Comparison of the peak redshift esti- 
mates from SOFT against the spectroscopic red- 
shift measurements from SDSS and SNLS. The 
SDSS SNe are plotted as (blue) squares, and SNLS 
objects are (magenta) circles. The solid line shows 
how the data would fall if SOFT photo-z's traced 
the spectroscopic redshifts perfectly. The RMS 
scatter about the line is 0.048, and the reduced x 2 
statistic is 1.51. 



A, SNLS-B, and combined samples, respectively. 
These values being close to unity suggests that 
the SOFT error bars are accurately reflecting the 
true redshift uncertainties. 

An appro priate compar i son fo r these results is 



the work of lOvaizu et al 
tometric redshift errors 



(|2008D . in which pho- 
from ugriz imaging of 



~ 300,000 SDSS galaxies are evaluated across a 
similar range of redshifts. Oyaizu et al. found 
that the RMS scatter in photometric redshift er- 
rors (Sz = Zphot — z spec ) for the set of all galaxies 
in their validation set was approximately 0.054. 
They also quantify the photo-z accuracy using cr 68 , 
which measures the range of the 6z distribution 
that contains 68% of their validation set objects. 
The distribution of SDSS galaxy photo-z errors is 
more sharply peaked than a Gaussian distribution, 
but with larger tails. The metric is less sen- 
sitive to the fat tails, so it shows a significantly 
tighter correlation to the spectroscopic redshifts, 
with a 68 fn 0.021. 



From this comparison we see that the galaxy 
photo-z's and the SOFT light curve redshifts can 
be very complementary techniques for doing SN 
science without spectroscopy. Suppose a SN is de- 
tected and a galaxy is identified as a likely host. 
We can derive a photo-z from optical photometry 
of the host, and also determine a completely inde- 
pendent photometric redshift using SOFT analy- 
sis of the SN light curve. In most cases the galaxy 
photo-z will be more precise (i.e. it lies in the 
sharp peak of the photo-z residual distribution). 
However, if the photo-z error bar is large, then we 
should take that as evidence that the SOFT red- 
shift is more reliable (i.e. the host photo-z may be 
a significant outlier, off in the fat tails of the resid- 
ual distribution). If the photometric redshift of 
the presumed host galaxy strongly disagrees with 
the SOFT redshift, then we should be suspicious 
that the galaxy may be a chance superposition and 
may not actually be the SN's host. Finally, in 
cases where the host galaxy is too faint for detec- 
tion, the SOFT light curve redshift can serve as a 
reliable stand-in. 

In practice, host galaxy photo-z's can be fed 
into the SOFT program as a redshift prior, in 
exactly the same way as spectroscopic redshifts. 
SOFT will naturally balance that host galaxy 
prior against the posterior information provided 
by the SN light curve - just as it is done in purely 
Bayesian approaches. When using priors drawn 
from host galaxy redshifts (be they photometric or 
spectroscopic), some care must be taken to avoid 
introducing a redshift bias. If the redshift distribu- 
tion of the galaxies is significantly different from 
the redshift distribution of the SN sample being 
studied, then the likelihood of chance superposi- 
tions is increased. For exa mple, the photo-z cata- 
log of lOvaizu et al.l (|2008h is primarily comprised 
of z < 0.4 galaxies, so if that catalog were used to 
provide redshift priors for a survey like the SNLS 
then it would lead to underestimated redshifts for 
all the high-z SNe. 

3.4. Photometric Cosmology 

In addition to the photometric redshift verifi- 
cation test, we have also evaluated the ability of 
SOFT to provide simultaneous estimates of red- 
shift and distance, for testing cosmological mod- 
els. From each SDSS and SNLS light curve we 
derived a composite membership function in the 
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(z,/i e ) plane, in the manner depicted in Figure Q] 
From the resulting composite membership func- 
tion, we extracted an estimate for z and /z e by 
locating the peak of the composite membership 
function. To estimate the uncertainty we mea- 
sured the height and width of the "1-a contour," 
which we define as the border of the smallest con- 
tiguous region about the peak that contains 68% of 
the integrated value of the composite membership 
function. Figure QJ: shows an example of the 1-a 
contour from a composite membership function. 

In Figure [3] we show a Hubble diagram con- 
structed with SOFT parameter estimates from all 
146 SDSS SNc, using spectroscopic redshift priors. 
As described in the caption, different symbols indi- 
cate the objects that are excluded by photometric 
and spectroscopic cuts when culling down to the 
SDSS-A, B, and C subsets. Error bars have been 
withheld for clarity, so a representative error bar 
is shown in the lower right. Overlaid on the data 
points is a dashed line representing the ACDM cos- 
mological model that is currently favored by ob- 
servational data, with Hq = 70.5 fcms -1 , £Im = 
0.273, and il A = 0.726 (jKomatsu et al.ll2009l ). 
Note that this model is presented here only as a 
metric for evaluating the SOFT results, and it was 
not generated by fitting to these data. Although 
the SOFT parameter estimates could be used to 
determine the best-fitting cosmological model for 
these particular data, such analysis is beyond the 
scope of this paper. The Hubble diagram in Fig- 
ure H shows the SOFT results from all 71 SNLS 
SNe, also computed using spectroscopic redshift 
priors. 

To quantify the comparis on of our SOFT pa- 
rameter estimates against the lKomatsu et al . 2009 
model, we compute the x 2 goodness-of-fit statistic. 
For the distance modulus uncertainty we use the 
quadratic sum of the positive and negative /x e un- 
certainties and a redshift uncertainty component: 



0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 



2 _ 2 

/./ /^,pos 



(6) 



For simplicity, we have projected the redshift un- 
certainty of each object onto the /i e axis using 



(1+f) 
Zn(10) J z(l + z/2) 



(7) 
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Fig. 3. — Hubble diagram showing SOFT pa- 
rameter estimates from the SDSS verification data 
set, using spectroscopic redshift priors. All 146 of 
the SDSS-II SNe survey are plotted, with differ- 
ent symbols to indicate the classification cuts that 
define SDSS subsets A, B and C. Median error 
bars for [i e are shown in the lower right. The 16 
SNe with ambiguous spectroscopic classifications 
(the "Type la?" objects) are plotted as (blue) 
squares. The (cyan) triangles indicated 10 ob- 
jects that SOFT classifies as Core Collapse SNe 
(5 of these are also in the "la?" set). An addi- 
tional 8 objects that have one-sided light curves 
are plotted as (purple) diamonds. The X symbol 
marks SN2005gj, a super luminous "Type Ha" ob- 
ject, which SOFT is unable to match with any 
Type la light curve. It is plotted at the posi- 
tion that SOFT would assign based on the best-fit 
template, which is the Type Iln SN 1994Y. In the 
lower panel we show the residual distance mod- 
ulus values as a function of redshift. Goodness 
of fit statistics for the comparison of these data 
against the ACDM reference model are provided 
in Table El 

sum of positive and negative redshift errors de- 
rived from the la contours. 

In addition to the reduced % 2 test, we measure 
the scatter about the model by computing the root 
mean square Hubble residual values: 



9 



0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 




0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
redshift 



0.2 



0.4 



0.6 



1.0 



44 
43 
42 

f 41 

.E 40 

39 
38 
37 

1 

QJ 

1 ° 

-1 



□ ^ ft' 




m a G 
^ O 151 □ 



median errors: 



° SNLS 
» SDSS 



□ □ 



■ Bcl TP- rff ,-o eL >' 51 -* - W - " 

i"S - n □ "° a □ 

„ o □ o n □ 

I I I l_ 



0.2 



0.4 0.6 
redshift 



0.1 



1.0 



Fig. 4. — Hubble diagram showing SOFT param- 
eter estimates of the SNLS verification data set, 
as in Fig. [3J For 14 objects with uncertain spec- 
troscopic classifications (the "Type la*" SNe) we 
plot (magenta) squares. A median error bar is 
shown in the lower right, and residuals are plotted 
in the lower panel. Goodness of fit statistics from 
the comparison of these data against the ACDM 
reference model are given in Table [21 



Fig. 5. — Hubble diagram from the Combined ver- 
ification data set (see Tabled]). All redshifts and 
distances are derived by SOFT alone, using the 
uninformative z 2 redshift prior. Objects with one- 
sided light curves and those which SOFT classifies 
as core collapse SNe have been rejected, but no 
spectroscopic classification cuts were used. Good- 
ness of fit statistics from a comparison against the 
ACDM reference model are shown in Tabled 



RMSu 



i 

N 



(8) 



where N is the number of SNe in the sample. A 
summary of these fit quality statistics for all data 
subsets is provided in Table [2] 

We can see from Figures [3] and [4] and from 
Table [5] that the photometric and spectroscopic 
cuts are doing a good job of removing serious out- 
liers: the x 2 an d RMS statistics improve each 
time a set of suspect SNe is removed. When spec- 
troscopic redshift priors are included, the RMS 
scatter from SOFT approaches the precision that 
can be achieved wi th other light curve fitters. 
As a recent example, Kessler et al.l ( 2009t ) applied 
MLCS2k2 and SALT-II to approximately the same 
data. Using 103 objects from the SDSS sample 
(comparable to our SDSS-C subset), Kessler et al. 
found RMSf, = 0.15 mags when using MLCS2k2. 



With 62 SNe from the SNLS sample (similar to 
our SNLS-B), they get RMS M = 0.24 mags@ 

As is to be expected, when SOFT has no spec- 
troscopic information to use for weeding out mis- 
classifications and for defining redshift priors, the 
resulting fit statistics are degraded. However, even 
with no spectroscopic help, SOFT is still able 
to produce consistent and reliable redshift and 
distance estimates across the entire range of z. 
The spectroscopy- free Hubble diagram of Figure [S] 
shows that the increased RMS scatter is not be- 
ing driven by large systematic shifts. The fact 
that reduced \ 2 statistics in the lower half of Ta- 
ble [2] are close to unity indicates that the SOFT 
uncertainty estimates are doing a good job of rep- 
resenting the true error in the z and fx e parameter 
estimates. Recall that our redshift priors for this 



9 See Table 11 of Kes sler et all j 20Qgft . but note that the xl 



values that they report are calculated with an additional 
0.16, which wc have not duplicated. 



internal error of t 
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Table 2 

Hubble Diagram : Goodness of Fit Statistics 



data set xl N DOF \ 2 /N RMS^ 
with spectroscopic priors 



SDSS-A 


504.5 


142 


3.55 


0.23 


SDSS-B 


398.3 


124 


3.21 


0.21 


SDSS-C 


178.1 


113 


1.58 


0.18 


SNLS-A 


135.8 


68 


1.99 


0.31 


SNLS-B 


93.5 


54 


1.73 


0.28 



no spectroscopic information 

SDSS-A 272.8 142 1.92 0.43 
SDSS-B 141.1 124 1.14 0.38 

SNLS-A 96.4 68 1.42 0.35 

Combined 238.5 195 1.22 0.37 



spectroscopy-free cosmology test were constructed 
under the assumption of a "worst-case scenario," 
with no host galaxy photo-z estimates at all. In 
reality, many SNe from future wide-field surveys 
will have good photometric rcdshifts from their 
host galaxies, which can significantly improve the 
SOFT parameter estimates. Those future wide- 
field surveys will also have much larger data sam- 
ples, helping to reduce the parameter estimate un- 
certainties that arise from random effects. As the 
sample size increases, however, systematic uncer- 
tainties will begin to dominate. In Section [5] we 
will consider potential modifications of the SOFT 
method that may help to minimize systematic er- 
rors. 



a, and therefore varies over cosmic time. A sim- 
ple and commonly used par ameterization for w(a) 
is to assume a line ar form (jChevallier fc Polarski 
20011: lLinderll2003h : 



w(a) = wq + w a (l — a) 



(9) 



For our simulations we add the additional restric- 
tion of a flat universe (f2k=0). The Fricdmann 
equation in this case is: 



H(z) 
Ho 



f2 M (l + z) 3 + 

(1 - Sl M ) (1 + z)^ 1+w "+ w ^ e-*™**] 1 ' 2 

(10) 



4. Variable Dark Energy 

One of the primary reasons for developing a tool 
such as SOFT is to deal with the extremely large 
SN data sets from a new generation of synoptic 
surveys. A principal science goal for these next 
generation surveys is to improve constraints on a 
time-variable dark energy (DE) equation of state. 
In this section we ask: How much of an improve- 
ment can be made with the precision provided by 
SOFT? 

As a foundation for our simulations we must 
first adopt a DE model in which the equation of 
state parameter, w, is a function of the scale factor 



4.1. Bootstrap Monte Carlo 

To investigate the informative power of future 
SN data sets, we use a series of Monte Carlo sim- 
ulations in which the real SOFT results from our 
validation tests serve as seed points for creating 
larger simulated data sets. Our goal is to mimic 
the observational data set that might be produced 
by PS1, DES, LSST, or another large and deep 
SN survey. The SDSS and SNLS data provide a 
good baseline for simulating those future surveys 
because they have similar light curve sampling and 
cover a similar redshift range. 

We create nine Monte Carlo data sets, la- 
beled A through I, using the time-varying DE 
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model with cosmological parameters fixed at 
(0 M ,0 D E,wo,Wa) = (0.3,0.7,-0.5,-1.0). The 
nine data sets are distinguished according to the 
number of SNe in the sample, and the source 
of redshift information for each object. The A, 
B and C data sets have 500 SNe; D, E, and F 
have 2000; while G, H, and I have 10000 objects. 
The first column (A,D,G) assumes an uninforma- 
tive z 2 prior for the redshift. The second column 
(B,E,H) uses (simulated) host-galaxy photometric 
redshifts. The final column (C,F,I) uses spectro- 
scopic redshift priors. 

The process of "bootstrapping" a cloned obser- 
vation out of the seed data is as follows: 

1. Draw a "seed" SN at random from the 
SDSS+SNLS data. 

2. Assume that the spectroscopic redshift, 
•ZspeC) is equal to the "true" redshift, ztrue, 
and use cosmo l ogical parameters from the 



Komatsu et al. (l2009h model to determine 



the "true" distance modulus /^true ■ 

3. Collect the SOFT results (zsofTjMsoft) for 
the seed object, using the appropriate red- 
shift prior (z 2 , host photo-z, spec-z). 

4. Determine the error on the redshift and dis- 
tance: Z e rr = ^SOFT ~ ^TRUE and fJ, eII = 
A*SOFT — MTRUE • 

5. Generate a clone point at a similar redshift 

ZCLONE = ZTRUE + ^OFFSET, where ^OFFSET 

is drawn from a normal distribution with 
standard deviation a = 0.05. 

6. Determine the clone's true distance modu- 
lus /icLONE based on its redshift, using the 
variable DE model with wq = —0.5 and 
w a = -1.0. 

7. Add an offset to z and fi to account for ob- 
servational error. These offsets are drawn 
from normal distributions centered on the 
SOFT error values z eTT and ^ orr from step 
4, with the distribution width reflecting the 
SOFT uncertainty values: a z = <5zsoft and 
°> = <^SOFT- 

8. Generate an error bar for the clone point 
in both the z and the fi e directions using 



the SOFT errors from the seed object, aug- 
mented by a random adjustment drawn from 
normal distributions with a z — e • <5zsoft 
and o^i = e • ^Msoft, where e = 0.05, 0.1, 
and 0.2 for the z 2 , photo-z, and spec-z pri- 
ors, respectively. 

At the end of this process we have a new cloned 
data point, which has been "observed" at a po- 
sition ZobsiMobs- This redshift and distance pair 
(along with associated uncertainties) incorporates 
the desired variable DE model, and also reflects 
the real errors and observational uncertainties of 
the original seed object. The sequence is repeated 
N times (where N=500, 2000, or 10000) to gener- 
ate a complete Monte Carlo sample. With each 
bootstrap Monte Carlo data set we then fit for 
cosmological parameters using the variable DE 
model. 

In addition to the simulated SN data, these cos- 
mological fits are constrained by observations of 
Baryon Acoustic Oscillations (BAO) and the Cos- 
mic Microwave Background (CMB). We use the 
BAO constra int on the sound bar rier r s /Dy from 
SDSS DR7 (|Percival et all |2010|) and the CMB 
constraint on the "shi ft parameter" R from the 
7-year WMAP results (jKomatsu et al.ll2010h . 



4.2. Results 

The likelihood contours from all nine Monte 
Carlo simulations arc shown in Figure |6] and the 
maximum likelihood values are summarized in Ta- 
bled The error contours and quoted uncertainties 
reflect statistical errors only. As is to be expected, 
the precision of the DE constraints improves as the 
sample size is increased, and as the redshift prior is 
strengthened. A convenient quantity for compar- 
ing the DE constraints from different experiments 
or simulations is the Figure of Merit (FoM) pro- 
posed by the Dark Energy Task Force (DETF). 
The DETF FoM is defined as the reciprocal of the 
area in the wq — w a pla ne that encloses the 95% 
confidence limit region (jAlbrecht et al.ll2006t) . In 
Table [3] we report the FoM values achieved with 
from the SN data alone, as well as the FoM from 
combined SN+CMB+BAO constraints. Both val- 
ues are normalized to the corresponding FoM from 
simulation C. This simulation, with spectroscopic 
constraints from 500 SNe, roug hly corresponds to 
the current state of the art (cf iDavis et al. 2007 : 
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Table 3 

Dark Energy Monte Carlo Simulation Results 







SOFT z 2 


Redshift prior source 
HOST photo-2 




SPEC-* 


500 


A 


wo = -0.67 ±° ;lt 

wa = -0.86 tlfi 

FoMsn = 0.59 
FoMall = 0.81 


B 


w = -0.61 t° f 8 
Wa = -1.02 tlf a 
FoMsn = 0.82 
FoMall = 0.96 


C 


wo = -0.54 tofi 

w a = -1.36 tlii 

FoMsn = 1.0 
FoMall = 1.0 


2,000 


D 


wo = -0.57 t° f s 

Wa = -1.06 04 

FoMsn = 1.75 
FoMall = 1.56 


E 


wo = -0.48 t°of 
Wa = -1.48 t\il 
FoMsn = 2.74 
FoMall = 1.97 


F 


w = -0.42 t°of 
Wa = -1.82 tlH 
FoMsn = 3.62 
FoMall = 2.21 


10,000 


G 


w = -0.42 t°;Jl 
wa = -1.48 ±°;£ 
FoMsn = 8.21 
FoMall = 4.51 


H 


wo = -0.48 +g;S| 

,,. _ _l 17 +0-67 
™o — J-- 1 ' -1.03 

FoMsn = 14.47 
FoMall = 7.38 


I 


wo = -0.46 tVol 
w a = -1.36 t° f 4 
FoMsn = 18.60 
FoMall = 8.67 



'Cell values indicate the label of the simulation set (A-I), the maximum likelihood 
values for the DE parameters wo and w a , and the DETF Figure of Merit (FoM). The 
FoMsn value is derived from only the SN constraints, while the FoMall uses the combined 
SN+CMB+BAO constraints. Both values are normalized to the results from simulation 
C, which approximates the current SN sample. Figure[6]for images of the confidence limit 
contours. 
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Fig. 6. — Likelihood contours in (wo,w a ) space for the dark energy "Bootstrap Monte Carlo" simulations, 
based on data from SDSS and SNLS. Filled contours (in blue) show the 95% and 99.9% confidence boundaries 
from the SN data alone. Unfilled contours (in red and orange) show the combined constraint from SN, CMB, 
and BAO observations. A cross in each panel marks the location of the input cosmological parameters 
wo = —0.5 and w a = —1.0, which were used to generate the simulated observations. Each row (ABC, DEF 
and GHI) shows a set of simulations with 500, 2000, and 10000 SNe, respectively. The columns show results 
from using three different sources for the redshift priors. The first column (ADG) uses only the very weak 
z prior. In the second column (BEH) we assume every simulated SN has a photometric redshift estimate 
from its host galaxy, and in the third column (CFI) we apply a strong spectroscopic prior. 
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Sollerman et~aj]|2009l: iKomatsu et aT1l2009l [201oh . 

The new up-and-coming surveys such as PS1, 
the Palomar Transient Factory, and Skymapper 
are best represented by simulations D and E, for 
which the FoM is improved by about a factor of 2. 
Not an insignificant gain, to be sure, but one that 
could easily be washed out by systematic uncer- 
tainties if they are not carefully controlled. The 
next leap forward, to be realized in the next decade 
by Pan-STARRS 4, JDEM and LSST, will bring 
the SN sample size to 10,000 or more objects, as 
in simulations G and H. 

Using the DETF FoM as our principal barom- 
eter, our simulations suggest that a spectroscopy- 
free analysis using SOFT should be sufficient to 
capitalize on the very large SN samples coming 
in the next decade. With each new generation of 
SN surveys the sample size increases by a factor 
of 4-5, and the SOFT-based FoM increases by a 
factor of 2-3. A close look at the uncertainties 
on Wq and w a in Table [3] highlights the fact that 
systematic effects will quickly become a dominant 
source of uncertainty. For a sample size of 2,000 
objects the statistical uncertainty on wq from our 
Monte Carlo simulations is around ±0.2, which is 
already comparable to the expected level of sys- 
tematic uncertainty. 

How can these results inform our strategic 
choices for new and upcoming SN cosmology 
projects? We would argue that the new surveys 
can be successful relying on light-curve-based red- 
shifts derived by SOFT or similar programs, in- 
formed by host galaxy photo-z's when possible. 
For spectroscopic follow-up, instead of targeting 
the most likely TN SNc, cosmological analyses will 
be better served by focusing resources on objects 
that are given an ambiguous classification grade 
by SOFT. Most of these objects will be SNe of 
type Ib/c or la, perhaps with unusual colors or 
anomalous luminosities that make them difficult 
to classify. To the extent that spectroscopic mea- 
surements are able to provide a more definitive 
classification, we can sharpen SOFT's ability to 
discriminate such marginal cases and help to re- 
duce systematic errors from misclassifications. 

In spite of the close connection to real data, the 
results of our bootstrap Monte Carlo approach are 
likely to be overly optimistic. We are necessarily 
overlooking the effects of sample selection and SN 
classification, and we have knowingly omitted sys- 



tematic uncertainties from our analysis. In order 
to realize these optimistic predictions, it will be 
necessary to make adjustments and improvements 
to the SOFT program itself that will improve its 
precision and reduce systematic effects. In the fol- 
lowing section we provide an outline for that future 
development. 

5. Bias Corrections 

As discussed in Section [5J our use of fixed shape 
light curve templates introduces an inherent bias 
into the parameter estimates that can be derived 
from each individual template. In this work we 
have addressed this problem by using the fuzzy 
intersection operator when combining membership 
functions. The fuzzy AND emphasizes collective 
agreement from all contributing templates, and 
therefore generally causes the individual biases to 
cancel out. In this section we consider two sep- 
arate approaches that might reduce the effects of 
individual template biases by detecting and pre- 
emptively correcting for those location parameter 
offsets. 

5.1. Template Calibration 

Figure [7] illustrates how one could use a SN with 
a known location to measure the size and direc- 
tion of template biases. The redshift and distance 
for our supposed calibration SN is marked by the 
"x." After a SOFT comparison of our calibration 
SN against the model Mi (based on SN 1999ee 
in Figure [7|), we can measure the peak location 
of the Mi cloud as (zi,/x e ,i). Knowing the true 
location of the SN, we can then measure the dis- 
tance from the cloud to the true position of the SN 
as (Azi, A/i ej i). This immediately gives us a bias 
correction vector /3i = (— Azi, — A/x e> i) that can 
be used to translate the entire probability cloud 
down to the true location of the peak. We can 
compute similar correction vectors for model Mi 
(as shown in Figure [7]) and all other models. 

Unfortunately, once we derive a set of bias cor- 
rection vectors using a particular calibration ob- 
ject, we can only apply them to other candidate 
SNe that are basically identical to the calibration 
object. If we have a candidate SN that is physi- 
cally more similar to the model M\ than our cali- 
brator, then the /3i vector shown in Figure [7] would 
be an over-correction. To avoid excessive bias cor- 
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Fig. 7. — Illustration of the bias correction vectors 
f3j that could be derived from a calibration of the 
SOFT template library. The background image 
shows the fuzzy membership functions for SNLS 
04D2ja (as seen in Figure [1} from two separate 
template comparisons: SN 1999ee and 1998aq. 
The integrated membership grade 7 for each tem- 
plate is listed as a percentage. The goal of a tem- 
plate calibration process would be to derive a sep- 
arate bias correction vector f3j for each template. 
For any given candidate the size and direction of 
each template's bias correction would be based 
on the unique pattern of integrated membership 
grades: r = {71, 72, . . . Jn}- 



rection, we need to scale the (3 values separately for 
each candidate SN according to its similarity with 
the calibration object. For this purpose, we can 
use the set of all integrated membership function 
values from Eq. [5] T = {71, 72, ■ • ■ 7jv}- As a first- 
order approximation, the bias correction function 
for model Mi would be: 



/?i(r) = bn7! +b 12 7 2 



t>iN7N (11) 



Here we have introduced the parameters bjk, 
which are two-dimensional constants that carry 
units of z and /i e If we have a candidate that is 



a 100% match to model Mi then 71 = 1 and the 
611 parameter by itself would define the necessary 
bias correction. If instead the candidate has mul- 
tiple non-zero 7 values for model Mi would be a 
weighted sum of the bjk parameters. 

Each template Mj needs its own scalable bias 
correction function (3j. We can describe the com- 
plete set of bias correction functions for all N tem- 
plates in matrix form: 



111 ' 




' bn 


bi2 ■ 


biN 




71 






&21 


&22 ■ 






72 


Pn _ 






bN2 ■ 


bNN 




7w 



(12) 

From this setup we can determine the appropri- 
ate values for the bjk parameters using a large 
calibration set of SNe with good light curves and 
spectroscopic redshifts. For each calibration SN, 
we would build up the single-template member- 
ship functions, and integrate them using Eq. [5] 
We would then measure the distance from the 
g(z, /j, e \Mj) peak to the SN's true location in order 
to get a j3j value for each template. Every individ- 
ual calibration SN thus provides an estimate for all 
components of the (3 and T vectors. If the set of 
calibration SNe contains at least as many SNe as 
the template library, then the system of N equa- 
tions in Equation Q2] can be solved for the optimal 
set of bjk values. One possible source for calibra- 
tion SNe would be to use a low-reds hift Hubble 
flow training set (e.g. Jha et al. 19991 ) comprised 
of SNe that are far enough away to have small 
peculiar velocity errors (cz > 2500 km s _1 ) but 
near enough that non-linear cosmological effects 
are negligible (cz < 30, 000 km s _1 ). 

This proposal is very simplistic, and may in fact 
be insufficient to account for the complexities of a 
real bias function. It is quite possible that the 
biases from individual templates are too inconsis- 
tent to allow for such a simple linear decomposi- 
tion. One would need to see whether a consistent 
solution of Eq. [12] is possible, and whether the ap- 
plication of a bias correction vector is actually able 
to improve the results for a verification data set, 
such as the SDSS and SNLS SNe used here. 



They could just as well include Ay and t pk , but we have 



assumed that those are being marginalized out as nuisance 
parameters. 
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5.2. Peer to Peer Cosmology 

An alternative method for removing location 
parameter biases is to use an approach that 
is based on selective comparisons. The SOFT 
method is founded on the assumption that two ob- 
jects with very similar physical parameters •& will 
necessarily have very similar light curve shapes. 
The biases that we are trying to remove arise 
primarily due to light curve shape mismatches be- 
tween the template and the candidate SN. Thus, 
if we could identify a group of SNe that have very 
similar 3? values, then we would expect a direct 
comparison between group members to be largely 
free of bias. 

One possibility would be to select candidate 
SNe that have very close matches in the template 
library. For example, we could pick out only those 
candidate objects that have a very strong match 
with a single template by elevating our template 
selection threshold to 7 > 90% or 95%. However, 
with only a few dozen templates in the library 
most SN candidates will not have a 90% match 
with any template, so the available sample size 
would be severely reduced. 

A modification of this approach could overcome 
the limitations of a sparse template library by 
leveraging the very large SN data sets to come 
from future surveys. Suppose that a candidate 
SN X is observed at an unknown location Ox and 
we compare it against our N templates to get 
a vector of integrated membership grades Tx = 
(7x1, 1x2, ■ ■ ■ Ixn)- A second candidate SN Y ex- 
ists at a different location Oy and has a differ- 
ent set of membership grades Ty. If these two 
objects have very similar physical characteristics, 
<& x ~ *&y, then we should expect that the same 
templates that are good matches for SN X will also 
be similar to SN Y, so that Tx ~ Ty . 

This provides a straightforward means for col- 
lecting a group of similar SNe, which we will call a 
"peer group." We compare all the available candi- 
dates against the template models, and then group 
together candidates that have a very similar pat- 
tern of membership grades T. The members of 
this peer group generally will not have a perfect 
match in the template library, so no single tem- 
plate can provide an unbiased estimate for any of 
the group members. However, we do expect that 
any member of this peer group could provide an 



unbiased estimate for the location parameters of 
any other member in the group. All that is needed 
is to redefine the candidate SN members to take 
the place of the template library. 

Recognizing that absolute distances are not 
necessary for cosmology, let us reduce the loca- 
tion vectors for our two peer SNe X and Y down 
to a pair of three-dimensional sub-vectors fix = 
{z x ,A Vt x,tpk,x) and i?y = (zy , A Vt y , t pk ,y). 
The procedure for executing a peer to peer com- 
parison is as follows: 

1. Fit a smooth spline curve to the photometric 
data of SN X, allowing us to interpolate to 
any point in time. 

2. Make a guess for -d x = {zx,A v ,x,t p k,x) 

3. For every spectrum in the SOFT spectral li- 
brary (Paper I, §2.2), apply the zx redshift, 
add reddening according to Ay.x, and shift 
in time to match t p k } x- 

4. Iteratively warp the adjusted spectra to 
force agreement with the observed griz pho- 
tometry of SN X. 

5. Remove the effects of the assumed fix from 
the warped spectra (i.e. apply a reverse time 
shift, dereddening, and a blue-shift) to get 
a rest-frame spectrophotometric model for 
SN X. 

6. Repeat steps 1-5 for SN Y to get a rest- 
frame spectrophotometric model of SN Y 
under the assumption of a location i9y = 

(zy, Ay,Y, t p k,y)- 

At the end of this process we will have a pair 
of rest- frame spectrophotometric models /x(A,i) 
and fy (A, t) giving the flux as a function of wave- 
length and time for both SN X and SN Y, at the 
assumed locations "dx and i9y. We now need 
to consider the relative luminosity distances for 
these two objects. Assuming that membership 
in the same peer group indicates that these two 
objects are physically almost identical, then they 
should have nearly the same intrinsic luminosity. 
This means that their relative luminosity distances 
Dl x and Dj^ y are encoded in the flux ratio: 
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With the addition of TZf we now have seven free 
parameters that define our rest-frame models for 
both objects. If the two SNe are in fact physically 
similar, and if we have guessed correctly about 
fix, fly and TZf, then we should expect to see the 
same flux values as a function of wavelength and 
time: /x(A,i) = TZf ■ fy(X,t). This means that 
we can compute a posterior likelihood for each set 
of model parameters in the same manner that was 
used for Eq. [T] 



JY 



p(D\ # x ,# Y ,TZ f )=Yl 



exp 



i=i ' (14) 



2t- 



Here we are computing a flux difference fx — Rffy 
at N points in time and wavelength! 11 ! The flux 
uncertainty term, <Xj could be computed as a 
quadratic sum of uncertainties from the SN X and 
SN Y models, accounting for the true observa- 
tional errors for both SNe as well as additional 
error introduced by the spline fitting and spectral 
warping steps described above. 

The posterior probability of Eq. [T?] can now 
be treated in the same manner as the probabil- 
ities derived for the template-based SOFT com- 
parisons in Section [2] After setting priors for all 
the components of "dx, fly and TZf , we can apply 
Bayes' theorem as in Eq. [2j The resulting prob- 
ability distribution can be marginalized over nui- 
sance parameters to get a three-dimensional func- 
tion in (z x ,zy ,TZf) space, which provides a direct 
constraint on any cosmological model. 

There are several advantages to be gained by 
adopting this type of strategy for measuring cos- 
mological parameters without spectroscopy. In 
addition to the bias minimization benefit, using 
peer to peer comparisons could provide a more ro- 
bust model for the rest-frame ultraviolet light. For 
example, the rest-frame U band light from a SN 



11 In theory, N could be an exceedingly large number, as the 
rest-frame models for SN X and SN Y could encompass sev- 
eral dozen points in time and several hundred distinct wave- 
lengths. To reduce the computational complexity, it would 
be advisable to integrate the spectrophotometric models 
into a handful of broad bandpasses in the visual wavelength 
range (these could be arbitrary box-car functions or real fil- 
ter transmission curves), and to select a few representative 
time points for comparison (perhaps t=-5,0,+10,+30 days 
from the peak). 



at redshift z>0.2 will be observed in visual band- 
passes (V or g). Thus, the g-band light curve of 
a z=0.2 template drawn from the Pan-STARRS 
survey is sufficient for comparison to the r-band 
light curve of a z=0.5 SN from the same survey. 
This approach also provides a natural set of inter- 
nal consistency checks. As SN surveys begin to 
collect thousands of TN SN light curves, it may be 
possible to define peer groups with 10 or 20 mem- 
bers that all have very strongly similar light curve 
shapes. Each direct comparison between two ob- 
jects in a peer group will provide an estimate of 
the redshifts for both objects. The consistency 
of redshift estimates for each object in the group 
provides a useful verification that the peer group 
is indeed defining a homogeneous set. 

5.3. Spectral Library Bias 

In addition to the biases of a sparse template 
library discussed above, SOFT may also suffer 
from systematic biases introduced by the spec- 
tral libr ary. We have use d the spectral tem- 
plates of Hsiao et al l ( 2007) for n o rmal Type la 
SNe, and those of iNugent et al.l (|2002[ ) to de- 
scribe overluminous 91T-like SNe and underlumi- 
nous 91bg-like SNe. It has been suggested that 
the spect ral properties of TN SNe may evolve with 
redshift dBlondin et al1l200d iBrqnder et al.112008 : 



Foley et al.ll2008at ISullivan et al.ll2009l) . in which 
case the use of these spectral templates based pri- 
marily on low-z SNe becomes inappropriate. This 
problem is particularly acute in the ultraviolet, 
where there can be substantial spectral variation 
([Ellis et al.l 120081 ). and where it is especially diffi - 
cult to collect a local sample (jFolev et al.ll2008bf ). 
The SALT2 light curve fitter addresses the UV 
problem by incorporating high redshift spec tra to 
define the rest- frame UV (|Guv et al.ll2007l ). Ex- 
tending the SOFT spectral library to avoid possi- 
ble bias will be especially important when SOFT is 
applied for parameter estimation of TN SNe with 
z> 0.8, where the rest-frame UV dominates the 
visible bandpasses. 

6. Summary 

In Paper I we introduced the SOFT program, 
using a set of fixed-shape light curve templates 
and the framework of fuzzy set theory for com- 
bining results from multiple templates. We ap- 
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plied SOFT as a SN classification tool in Paper I, 
and in this companion paper we have shown how 
SOFT can also be used to estimate parameters of 
cosmological interest: redshift z and luminosity 
distance /i e - The SOFT method is distinct from 
light curve fitters such as MLCS and SALT in that 
it does not describe the light curve shape with a 
parameterized model. Given this fundamental dis- 
tinction, SOFT may provide a valuable addition to 
the parametric fitters in that it could have smaller 
— or at least different — systematic uncertainties. 

Using Type la SN light curves from the SDSS 
and SNLS data sets, we have performed a set 
of verification tests to demonstrate the accuracy 
of SOFT. We found that the SOFT redshift es- 
timates are comparable to optical photo-z mea- 
surements for SDSS galaxies: both methods yield 
Sz = zsoft — z spec residuals with an RMS scatter 
of approximately 0.05. Applying SOFT to derive 
(z,/x e ) coordinate pairs for a joint SDSS+SNLS 
sample, we find an RMS scatter around the ACDM 
model of 0.18 mags in /i e when including spectro- 
scopic redshift priors. When SOFT analyzes the 
SDSS+SNLS sample with no spectroscopic infor- 
mation at all, we find the RMS scatter of Hubble 
residuals increases to 0.37 mags. 

To investigate the near future of SN cosmol- 
ogy, we considered the degree to which a variable 
DE model can be constrained by larger SN sam- 
ples containing thousands of light curves but very 
little spectroscopic follow-up. Using a bootstrap 
Monte Carlo approach, we simulated the distance 
and redshift estimates that might be obtained by 
SOFT for 9 different survey structures. We find 
that SOFT should be able to improve the DETF 
FoM by a factor of 2-3 with each new generation 
of SN surveys. 

The SOFT program has not yet realized the 
full potential of its fuzzy logic approach, but 
we have outlined several pathways for improving 
the method and limiting systematic biases. We 
have proposed a straightforward method for us- 
ing a training set to calibrate the template li- 
brary, and have also presented a "Peer to Peer 
Cosmology" approach in which SOFT can iden- 
tify groups of similar SNe and do a direct com- 
parison to determine their redshifts and relative 
distances. This latter method may be especially 
applicable in upcoming SN surveys such as Pan- 
STARRS and LSST, which will have an abundance 



of well-sampled multi-color light curves, but com- 
paratively little spectroscopic followup. 

Acknowledgments: We would like to thank 
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cal reading of this work, which led to substantial 
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